Sociology 229: Advanced Regression Models
Short Assignment
#1: Multinomial Logistic Regression
Due: Start of class (9:00) January 19
This assignment requires a dataset on the course website entitled “Assignment 1 Multinomial Data.dta”. The dataset includes information on approximately 1,200 Protestants respondents from the GSS. The dataset includes a variable “religid” which indicates whether the respondent is a “fundamentalist”, “evangelical”, “mainline”, or “liberal” Protestant.
Question 1: Based on the simplest model (with education only), which type of protestants is most educated? Which is least? Which differences are statistically significant (compared to mainline protestants)? Can you draw statistical inferences about the difference between fundamentalist and evangelical Protestants from this analysis?
Question 2: Interpret the coefficient for gender (female dummy) on the choice between mainline and fundamentalist Protestantism. Discuss the raw coefficient (which indicates direction), the relative risk ratio (which is analogous to an odds ratio), and the % difference in relative risk. Do the results differ when you shift the reference outcome from mainline to fundamentalist?
Question 3: Comment briefly on the consequences of changing the reference outcome from mainline to fundamentalist Protestants: Generalizing from your experience from Question 2, what happens to coefficients for mainline vs. fundamentalist Protestants when you swap the reference groups? Also: Which contrasts can you directly examine once fundamentalist Protestants are the reference group (that could not be easily made when mainline protestants were the reference group)? Which contrasts can no longer be directly assessed?
Question 4: Comment briefly (2-3 sentences) on the binary logistic regression results. Were they similar to the multinomial results overall?
Turn in the following: